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OPTIMAL  PROBLEM-SOLV'NG  SEARCH- 
ALL-OR-NCNE  SOLUTIONS 

Herbert  A.  Simon 
and 

Joseph  B.  Kadane 
Carnegie-Mellon  University 


ABSTRACT 


Optimal  algorithms  are  derived  for  satisficing  problem-solving  search,  that  is, 
search  where  the  goal  is  to  reach  any  solution,  no  distinction  being  made  among 
different  solutions.  This  task  is  quite  different  from  search  for  best  solutions  or 
shortest  path  solutions. 

Constraints  may  be  placed  on  the  order  in  which  sites  may  be  searched.  This 
paper  treats  satisficing  searches  through  partially  ordered  search  spaces  where  there 
are  multiple  alternative  goals. 


(1)  Simon  was  supported  in  pai  t by  Research  Grant  MH-07722  from  the  National 
Institute  of  Mental  Health  and  in  part  by  the  Advanced  Research  Projects  Agency  of  the 
Office  of  the  Secretary  of  Defense  (P44620-7  3-C-0074)  which  is  monitored  by  the  Air 
Force  Office  of  Scientific  Research;  Kadane  was  supported  in  part  by  the  Office  of  Naval 
Research  under  contract  number  NOCOi 4-67-A-0314-002. 
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OPTIMAL  PROBLEM-SOLVING  SEARCH: 

ALL -OR -NONE  SOLUTIONS 

Herbert  A.  Simon 
and 

Joseph  B.  Kadane 
Carnegie-Metion  University  (1). 

In  the  representation  of  problem  solving  as  a search  through  a tree  or  directed 
graph,  several  different  cares  must  be  distinguished.  In  one  case  (Best-Value  Search), 
values  are  associated  with  terminal  nodes,  and  the  aim  of  the  search  is  to  discover  the 
node  bearing  the  highest  value.  This  case  is  only  interesting  if  information  becomes 
available  during  the  course  of  the  search  which,  by  excluding  some  portions  of  the 
search  space,  makes  exhaustive  search  unnecessary. 

In  the  second  case  (Shortest -Path  Search),  which  can  be  treated  as  a special  case 
of  the  first,  the  value  associated  with  a terminal  node  is  the  length  of  the  shortest  path 
from  the  starting  point  to  that  terminal,  and  the  8im  is  to  find  the  terminal  with  the 
smallest  value  (i.e.,  the  lerminel  closest  to  the  starting  point).  In  either  the  first  or 
second  case,  a further  requirement  may  be  imposed  on  the  search  algorithm  that  it  be 
the  algorithm  for  finding  the  best  value  or  shortest  path,  as  the  case  may  be,  that 
minimizes  the  expected  search  effort  for  attaining  its  goal. 

In  a third  case  (Satisficing  Search),  there  is  a designated  subset  of  terminal  nodes 
called  goals,  and  the  aim  of  the  search  is  to  reach  any  of  these  goals.  No  distinction  is 
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made  between  the  values  of  different  goals  — all  goals  are  equally  desirable.  In  one 
variant  of  this  case,  there  is  a single,  unique  goal,  and  only  one  paih  leading  to  it.  In 
either  variant  of  this  third  case,  we  are  interested  in  search  algorithms  that  minimize 
the  expected  search  effort  for  reaching  the  first  goal  (or,  it  the  goal  is  unique,  tor 
reaching  that  goal).  It  is  this  third  case  which  is  the  subject  ot  the  present  paper. 

Tasks  of  all  of  these  three  types  are  common  in  the  literature  of  artificial 
intelligence.  Game-playing  programs  are  concerned  with  discovering  "best"  moves, 
hence  involve  best -value  search.  Programs  for  solving  certain  scheduling  proolems  — 
for  example,  the  Traveling  Salesman  problem  --  search  for  shortest  paths.  Theorem- 
proving programs,  however,  and  most  problem-solving  programs  are  concerned  with 
satisficing  search. 

Search  algorithms  designed  to  handle  tasks  of  the  different  types  may  need  to  be 
vastly  ciifferenl,  and  the  search  effort  required  to  nnd  soiuiions  may  respond  to  quite 
different  parameters  of  the  task  environments.  Suppose,  for  example,  that  needles  of 
varying  sharpness  have  been  distributed  randomly  throughout  a haystack  of  size  E-  A 
best-value  search  algorithm  designed  to  find  the  sharpest  needle  in  the  haystack  will 
have  to  search  the  entire  stack,  and  will  require  an  effort  proportional  to  E A 
satisficing  algorithm  to  find  a needle  sharp  enough  tor  sewing  will  only  have  to  search 
until  it  discovers  one  such  needle.  Its  expeded  search  effort  will  be  inversely 
proportional  to  the  average  density  of  "sharp  enough"  needles  in  the  stack,  and 
independent  of  £. 

If  the  needles  a<e  not  distributed  with  uniform  density,  then  the  kind  of 
information  about  the  distribution  that  would  be  useful  to  guide  a best-value  search 
may  be  quite  different  from  the  information  that  would  be  useful  to  guide  a satisficing 
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search.  Intuitively,  one  can  see  that  tor  best-value  search  it  would  be  helplul  to  be 
able  to  set  an  upper  bound  on  the  sharpness  ot  the  needles  to  be  found  in  any 
particular  region.  Tor  satisficing  search,  one  would  want  to  know  the  probability  of 
finding  a sharp-enough  needle  in  any  given  region.  From  this  simple  example,  therefore, 
we  see  that  we  need  separate  theories  of  search  algorithms  for  best-value  and 
satisficing  search,  respectively. 

Several  cases  of  satisficing  search  have  already  been  treated  in  the  literature. 
These  include  both  the  case  where  sites  may  be  searched  in  any  order,  without 
constraint,  and  the  case  where  sites  are  partitioned  into  a number  of  classes,  and  there 
is  a specified  order  in  which  sites  in  each  class  must  be  searched  (but  no  bctween-class 
constraints).  We  call  this  case,  "parallel  search." 

I r,  tbs  literature  of  artificial  intelligence  (Chang  and  Slagle,  1971;  Kowalski,  1969, 
1972;  Nilsson,  1971;  Pohl,  1971),  algorithms  are  to  be  found  for  best-value  and 
shortest-path  search  through  trees.  A recent  paper  by  Garey  (1973)  provides  an 
algorithm  for  satisficing  search  through  trees,  but  neither  the  algorithm  nor  its  method 
of  derivation  encompasses  general  partial  orderings.  It  is  the  purpose  of  this  paper  to 
fill  this  gap  by  extending  our  results  to  satisficing  searches  where  the  ordering 
constraints  are  typical  of  those  in  problem-solving  tasks:  that  is,  to  searches  in  partial 
orderings. 

In  the  first  two  sections,  we  will  review  the  optimal  search  algorithms  for 
unconstrained  and  parallel  (satisficing)  searches,  respectively.  In  the  third  section,  we 
will  extended  our  results  to  satisficing  searches  through  partially  ordered  search  spaces 
in  the  case  where  there  are  multiple  alternative  goats. 

1.  Satisficing  Search  Without  Ordering  Constraints 
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An  unknown  number  of  chests  of  Spanish  treasure  have  been  buried  on  a random 
basis  at  some  of  tl  sites,  at  a known  dep  h of  three  feet.  For  each  site  there  is  a known 
unconditional  probability,  pii),  i«l,...,a  that  a chest  was  buried  there,  and  the  cost  of 
excavating  site  i is  fl(L). 

A strategy,  1,  is  « permutation  of  any  subset  of  the  integers  from  1 to  a 
Suppose  a subset  of  sites  is  searched  in  the  order  given  by  L under  the  condition  that 
the  search  is  terminated  as  soon  as  one  treasure  is  found,  then  we  can  associate  with 
the  strategy  t an  expected  cost  Ml)  of  this  terminating  search,  and  a probability  £(t) 
that  a treasure  will  be  found.  £(L)  ■ 1-S.(D,  where  £(D  is  the  probability  that  there  is 
treasure  at  none  of  the  sites  of  L We  assume  that  ¥(t)  > 0 and  £(t)  > 0 for  all  t That 
is,  no  site  can  be  excavated  without  cost  and  no  site  contains  a treasure  with  certainty. 

Let  (at)  be  the  strategy  consisting  of  executing  strategy  a.  followed  by  b,  where 
the  subsets  of  a and  b are  non-overlapoing.  Then  by  our  definitions,  we  have: 

<l.l)  V(i)  « q(i), 

where  i is  the  strategy  of  excavating  the  ith  site. 

(1.2)  V(ab)  « V(a)  ♦ $(a)V(bj 

(1.3)  S(ab)  ■ S(a)S(b). 

Equation  (!.2)  states  that  the  expected  cost  of  te'min.iting  search  over  (ab)  is 
the  expected  cost,  ¥(a).  of  a terminating  search  over  a plus  the  product  of  the  expected 
cost  of  a terminating  search  over  b by  the  probability,  £(i),  that  the  latter  search  is 
necessary  (i.e.,  that  treasure  was  not  found  in  a).  Equation  (1.3)  states  that  the 
probability  of  not  finding  a treasure  in  (ab)  is  the  product  or  the  probabilities  of  not 
finding  treasures  in  a and  b,  respectively. 

The  functional  equations  (1.2)  and  (1.3)  are  studied  t Kadane  (1969) 
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It  is  evident  that  S is  associative  and  commutative,  so  that  SUtbX)  ■ b(#(b£))  and 
S(ai2>  “ S<ba);  while  y is  associative,  but  not  commutative.  Defining  fij  - and 

S(Aq)  - S(a())  ■ 1,  we  find  readily  that: 

(1.4)  V(Ar)  - Vta,,*^  ,ar) 

r i-L 

« E n S(a|)V(aj> 
i-l  j=0 

r 

- E StAj.jMaj) 
i-l 

We  also  note  for  later  reference  that: 

(1.5)  P(ab)  - P(a)  ♦ P(b)  - P(a)P(b)  - P(a)  ♦ S(a)P(b). 

We  consider  now  the  effect  upon  the  expected  cost  of  search  of  excavating  the 
same  set  of  sites  but  in  different  orders  --  by  strategies  (tb&D  and  (atfaal),  say,  where 
g and  tf  may  be  empty,  S(&),  with  g :mpty,  equals  1,  and  ¥(g),  with  g empty,  equals  zero. 
(16)  V(abcd)  - V(acbd) 

- V(a)  ♦ S(a)V(bcd)  - V(a)  - S(a)V(cbd) 

(1.7)  - S(a)[V(b  ;)  - V(cbd)] 

(1.8)  - S(a)[V(bc)  + S(bc)V(d)  - V(cb)  - S(cb)V(d)] 

But,  since  Sdit)  ” S<tb).  Equation  (1.8)  simplifies  to: 

(1.9)  V(abcd)  - V(acbd)  - S(a)[V(bc)  - V(cb)J 

(1.10)  - S(a)[V(b)P(c)  - V(c)P(b)] 

In  particular,  if  b and  c.  consist  ol  the  single  sites  i and  j,  respectively,  then  it  will 
be  cheaper  to  excavate  i before  i iff  $(i)  > (9(i),  where  $d)  - E(L)/¥(t.)  Moreover,  this 
result  holds  for  all  i and  j.  The  optimal  strategy,  therefore,  for  finding  a single  treasure 
is  to  excavate  sites  in  descending  order  of  p(i)  until  a treasure  is  discovered. 

Similar  results  were  obtained  by  Dean  (1965),  Kadane  (1969),  Joyce  (1971),  and 
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2.  Search  With  Parallel  Ordering 

Suppose,  now,  that  the  Spanish  treasuies  are  buried,  as  before,  but  that  neither 
the  sites  nor  the  depths  of  burial  are  Known  with  certainty.  At  each  site  a sequence  of 
one-foot  slices  can  be  excavated,  and  a treasure  may  be  disclosed  by  the  removal  of 
any  one  of  these  slices.  The  probability  that  a treasure  lies  just  below  any  specified 
slice  is  Known.  Designate  the  probability  that  the  treasure  lies  below  slice  1 of  site  i as 

*<a>. 

A particular  slice  (bg)  car.  only  be  searched  after  all  the  other  slices  above  it,  (i,l), 
L < g,  have  been  searched.  Hence,  an  admissible  search  strategy  will  be  an  ordering  of 
a subset  of  slices  such  that  (tgl  does  not  precede  (j,[)  if  g > t For  a given  strategy,  let 
UuD  he  the  order  number  of  the  slice  (jJJ. 

Now,  we  can  define  quantities,  )U\),  E(D,  and  Sit)  exactly  as  before,  so  that 
Equations  ,1.1)  through  <1.!G)  are  again  valid  for  the  admissible  strategics.  VM*  wish  to 
find  the  strategy  thal  minimizes  ¥<L>  where  t ranges  Over  permutations  of  the  entire  set 
of  q integers  subject  to  the  order  constraint  that  Kul)  > Kug)  if  g > t 

A bloc  cf  slices  is  a set  of  slices  belonging  to  the  same  site  that  are  consecutive 
in  that  site.  (The  members  of  a bloc  need  not  be  consecutive  in  any  particular  strategy, 
since  they  may  be  interspersed  with  one  or  more  slices  from  other  sites.) 

In  a strategy  (abc).  b is  (weaXly)  monotonic  decreasing  if  for  each  pair  of 
segments,  with  order  numbers  j,  i in  b,  pit)  5 Pit)  if  i < i-  The  substrate®1/  b is 
monotonic  increasing  if,  for  each  j,  ± in  the  sequence  P4)  i PU)  if  i < j. 

In  a strategy  (abed),  b and  c are  interchangeable  if  permuting  the  strategy  into 
(atbd)  does  not  violate  the  oidei  constraints  (i.e  , does  produce  another  strategy). 

Theorem  2 1:  In  (ab£ii)i  b and  g are  interchangeable  iff  no 
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site  represented  by  one  or  more  slices  in  b is  represented  by  any  slices  in 

£. 

Proof:  The  order  constraints  apply  to  pairs  of  slices  belonging  to 
the  same  site  Under  the  conditions  of  the  theorem,  interchange  of  b with  c 
will  not  reverse  Ihe  order  of  any  pair  of  slices  belonging  to  the  same  s>te, 
hence  will  not  violate  any  order  constraints. 

Conversely,  if  slice  t(uD  *n  b and  slice  l<ui>  w C.  belong  to  the  same 
site,  then  the  order  constraints  require  that  [ < j (since  t(Lt)  < t(ul))  W, 
in  (cb)  we  will  have  Ki,t)  > wnich  violates  the  constraints. 

Q E 0. 

Theot  em  2.2.  If  b and  c.  are  interchangeable  m (abed),  and  if 

9(t)  * P(k).  then  the  strategy  can  be  improved  by  interchanging  b and  j* 

hence  iv  not  Optimal 

Proof:  Using  Equation  (1.10),  we  have: 

(2.1)  V(abcd)  - V(acbd)  - S(a)[V(b)P(c)  - V(c)P(b)J 

(2.2)  - S(»/'/{b)V{c)[P(c)  - 9(b)]  > 0. 

Q E.  D. 

Theorem  2.3:  If  b and  d are  strategies  tnat  are  also  consecutive 

blocs  o(  Site  l with  d following  bj  »nd  i*  ^td>  * 9(b>.  then  a strategy  ot 

(abede)  with  £ non-null  is  not  optimal. 

Proof:  8y  the  ordering  constraints,  b cannot  precede  b Suppose  it 
does  not  follow  immediately  (c  is  non-nuH)  Since  (fad)  :s  a bloc,  r*o  member 
of  i,  belongs  to  site  g for  all  other  slices  of  i must,  by  tlie  ordering 
constraints,  precede  b or  follow  d Either  9 (g)  > 9(b).  or  9<t)  * 0(b)  < 9^(*)- 
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In  the  firs*  case,  the  strategy  can  be  improved  (Theorem  2.2)  by 
interchanging  t wilh  fc,  heme  is  not  optimal  tn  the  second  case,  the 
strategy  can  be  improved  by  interchanging  (,  with  d,  hence  is  not  optimal. 

0 E D 

We  caii  the  bloc  (fed)  ot  Theorem  23  an  mdivisiblp  falo:.  Each  indivisible  bloc  is 
made  up  ct  consecutive  slices  from  a single  site,  and  its  indivisibility  depends  only  on 
the  0 s of  strategies  from  that  site.  Hence  we  can  now  proceed,  for  each  individual  site, 
to  determine  *ts  Mamma!  i> .divisible  blocs  by  joining  blocs  that  satisfy  the  conditions  of 
Theorem  2 3 until  the  0 s for  all  the  separate  blocs  the*  remain  are  monotonic 
JcOfcrtsuifc  We  ran  Male  bus  resuil  as  a corollary; 

Corollary  2.3  1;  An  optimal  solution  consists  of  a sequence  of 
mammai  indivisible  blocs  such  that  the  0's  of  the  successive  blocs  0*  arty 
given  site  are  monotomc  decreasing 
We  are  now  read/  for  the  mam  theorem. 

Theorem  2 4;  If  a strategy  consists  of  a sequence  of  mammal 

indivisible  blocs,  and  it  the  0's  associated  with  these  blocs  are  monotonic 

decreasing,  then  the  strategy  is  optimal 

Proof;  1 Corollary  2.3.1  guarantees  that  any  optimal  strategy  must 
consist  of  a sequence  of  mammal  indivisible  blocs,  ar.d  that  the  P’s  cf  the 
subsequerce  ol  blocs  belonging  to  any  given  si*e  are  monotonic  decreasing. 

2 The  strategy  defined  m Theorem  2 4 is  ui  que  up  to  trivial 
interchanges  of  segments  with  equal  0,  which  do  not  change  the  value  of 

3 The  oniy  allowable  permutations  that  preserve  the  maximal 
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indivisible  blocs  and  the  < order  involve  interchanges  of  strategies 
telongmg  !o  different  sites  and  not  separated  by  a strategy  belonging  to 
either  of  their  sites 

4 Suppose  a strategy  is  optimal,  but  that  0 is  not  monotonic 
decreasing  Consider  the  first  instance  where  0(b)  > 0(g).  with  a 
immediately  preceding  b Since  the  blocs  belonging  to  a given  site  are 
monotonic  decreasing  in  value,  a and  b must  belong  to  different  sites,  hence 
are  interchangeable,  by  Theorem  2 1 Therefore,  by  Theorem  2.2,  the 
strategy  would  be  improved  by  interchanging  a and  tb  contrary  to  the 
hypothesis  that  the  strategy  is  optimal 

0-  E D. 

Theorem  2 4 tells  us  that  the  optimal  strategy  m digging  for  ciouhioons  is  to 
calculate  the  average  yield  (per  fool  of  digging)  for  each  maximal  indivisible  bloc  of  a 
site,  then  excavate  the  successive  blocs  in  decreasing  order  of  yield. 

The  evaluation  functions  0,  take  into  account  not  only  the  potential  return,  c.(;), 
and  cost,  Cm/,  from  the  slice  being  executed,  but  the  prospective  value  of  getting  closer 
to  underlying  slices  that  have  larger  0 values  than  the  current  slice.  This  characteristic 
of  the  evaluation  function  adds  a certain  "depth-first"  tendency  to  the  strategy.  For 
example,  suppose  that  >t  is  known  that  the  treasure  is  buried  not  less  than  five  feel 
below  the  gi  ound.  Then,  if  the  optimal  strategy  calls  for  excavation  to  begin  at  the  it h 
site,  it  will  continue  at  that  site  until  the  excavation  has  reached  a depth  of  at  least  five 
leet. 

Jn  the  special  case  in  which  g?i  implies  0(i,g)  < 0(ut).  Kadane  (1969)  finds  the 
optimal  ordering  to  be  according  to  0.  The  results  of  this  section  are  new  when  the 
above  nndition  is  not  sab-died. 
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3.  Search  Through  a °artially-Ordered  Space 

Our  next  task  is  tc  extend  the  results  of  the  previous  section  to  a search  through 
a partiaily-ordered  set  of  nodes,  or  cycle-free  graph,  which  includes  the  familiar  case  of 
search  througn  a tree.  Theorem-proving  and  problem-solving  searches  are  commonly 
representable  as  searches  through  trees  Or,  more  generally  through  partial  orderings. 
In  such  a search,  a new  node  is  obtained  by  applying  on  operator  to  branch  from  some 
node  reached  previously.  In  this  section,  we  will  prove  a theorem  {Theorem  3.1)  for 
optimal  search  through  a partial  ordering  which  is  analogous  to  Theorems  2.3  and  2.4 
for  parallel  orderings.  As  before,  the  key  role  is  played  by  an  evaluation  function,  P", 
which  can  be  assigned  to  each  branch  at  each  node  already  reached  in  such  a way  that 
it  is  always  optimal  to  search  next  the  branch  with  largest  <f>". 

The  proof  of  optimality  for  a partial  ordering  is  a great  deal  more  complex  than 
the  proof  for  a parallel  ordering,  mainiy  because  j)  for  a node  now  has  to  be  maximized 
over  all  the  alternative  sequences  descendant  from  that  node.  The  notion  of  the  “best 
set"  of  a node  (the  set  of  nodes  descendant  from  that  Rode  for  which  0 is  maximum) 
replaces  the  "maximum  indivisible  bloc"  of  the  previous  section.  The  Q"  of  Theorem  3.1 
is  this  best  set. 

As  before,  we  assume  that  for  each  node,  i,  of  the  set  a value  eU)  is  given, 
representing  the  probability  that  a solution  will  be  found  at  that  node.  The  cost  of 
excavating  each  node  from  one  immediately  before  i 

A strategy  t(C"  for  a set  of  nodes,  Q,  . ’dering  of  the  nodes  of  D that 

satisfies  the  order  constraints  or.  those  nodes.  snd  Q be  two  mutually  exclusive 

sets  of  nodes,  and  £ their  set  sum.  The  £ and  0 are  interchangeable  iff  there  exists  a 
strategy  c 55  fail)  and  a strategy  c.”  = ltz''a"),  where  t and  t"  are  strategies  on  £,  a and 
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i"  strategies  on  A,  and  h and  b"  strategies  on  2-  Clearly,  if  A and  B are 
interchangeable,  if  a is  any  strategy  of  A and  if  b is  any  strategy  of  Q,  then  (ab)  and 
(ba)  are  strategies  of  C- 

Corresponding  to  the  notion  of  a bloc  in  the  previous  section,  we  introduce  the 
concepts  of  initial  and  terminai  blocs  of  a set  of  nodes.  Let  A tie  a partially  ordered  set 
of  nodes,  and  let  it  contain  Q and  Q « A~B  Then  B is  an  initial  bloc  of  A iff  there  exist 
strategies  h on  0 and  t on  Q such  that  a ■ (bi.)  is  a strategy  on  A g is  a terminal  bloc 
of  A iff  there  exist  strategies  t on  g and  t on  C such  that  a ■*  (eg)  is  a strategy  on  A- 

As  before,  we  can  define  for  each  strategy  the  quantities  ¥(t),  Ed),  £(t)  and 
0(t)  - E(t)/¥(t).  all  of  which  depend  only  on  the  subset,  Q,  and  its  ordering,  t, 
independently  of  the  remaining  nodes  <n  the  entire  set.  Note  that  Efl>  and  S(L)  are 
constant  over  all  strategies  of  a given  set,  Q,  hence  are  functions  of  Q;  while  and 
(J(t)  depend  upon  the  strategy,  L as  well  as  the  set,  Q. 

A strategy  of  a set  of  nodes,  Q,  for  which  p assumes  its  greatest  value  for  thal 
set  will  be  called  a best  strategy  of  Q and  will  be  designated  by  i*(Q)  and  its  ¥ by  V’(D>- 
An  initial  bloc,  Q,  of  set  X for  which  p’(Q)  is  maximal  over  all  initial  blocs  of  X will  be 
called  a best  sef  of  Xj  and  the  (5  of  its  best  strategy  will  be  designated  by  P"(I). 

We  now  prove  four  lemmas  that  are  needed  tor  our  main  theorem. 

Lemma  3.1.  Let  A -and  AL>2  be  initial  sets  of  I such  that:  (1) 

A is  the  best  set  of  L with  best  strategy  t’(A)  “ Sj  and  (2)  b is  any 
strategy  for  g. 

Then,  (4"  > p(b). 

Proof.  Since  A is  the  best  set  of  L P"  “ P(a)  i P(ab)- 
Now,  using  Equation  1.5, 
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(3.1)  V(ab)0(ab)  = P(ab)  “ P(a)  + S(a)P(b) 

(3.2)  «V(a)0(a)  ♦ S(a)V(b)0(b) 

But  V(ab)0(a)  > V(ab)0<ab),  so  that, 

i (3.3)  V(ab)0(a)  > V(a)0(a)  ♦ S(a)V(b)0(b> 

t 

[ Expanding  the  left-hand  side,  we  get: 

(3.4)  [V(a)  + S(a)V(b)]0(a)  > V(a)0(a)  ♦ S(a)V<b)0<b) 

(3.5)  S(a)V(b)0(a)  £ S(a)V(b)0(b>. 
so  that,  since  we  have  postulated  that  S(a)  / 0 and  V(b)  0, 

(3.6)  0(a)  - 0"  > 0(b) 

0.  E.  D. 

Lemma  3.2.  Let  A be  a set  consisting  of  the  mutually  exclusive 
subsets  of  nodes  B.  CL  and  Q,  where  8 is  an  initial  bloc  of  A.  while  C and  Q 
are  interchangeable,  hence  also  both  terminal  blocs.  Let  the  best  strategy, 
t’(A)  be: 

t’(A)  “ <ii£.l{li...cKdK), 

where  b is  a strategy  for  B,  t " (Li  -C^)  is  ■ strategy  for  G and 
(1  - (di-dfc'  is  a strategy  for  Q. 

Then  0(tj)  > 0(flj)  > ...  > 01^)  > 0(dfc) 

Proof:  Suppose  0(dj)  < 0(£^j).  Then,  by  Equation  (1.10),  t’(A)  could 
be  improved  by  exchanging  d,  and  C^fj,  contrary  to  the  hypothesis  that 
0(A>  is  maximal.  Bui  the  exchange  is  admissible,  since  Q and  Q are 
interchangeable.  Similarly,  the  supposition  that  0(Cj)  < 0(dj)  leads  to  a 
contradiction. 

Q.  E.  D. 
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Lemma  3.3.  Given  A,  EL  Q and  Q as  in  Lemma  3.2,  with 
£ * (£i— tv  and  d = (dj  dfc),  suppose  thai  A is  a best  set  of  I,  sc  that 
0’(A)  - 0"  Then:  0(d)  2 0’(A),  and  therefore  0’(Q>  2 0’(&)  » 0*. 

Proof:  Define, 

£ - (bijdi-  £*) 

Then, 

B(£)*S(£)E<dK) 

0*  • 

We^SteMd*) 

If  0^)  - E<d*)/¥(d*)  < 9",  then  0(g)  > 0".  But  £ is  a strategy  for 
an  initial  bloc  of  A,  and  also  of  L Since  0’  is  maximal  over  all  such  blocs, 
the  inequality  is  a contradiction.  Therefore  0(d^)  2 0". 

Bu'i,  by  Lemma  3.2, 

£{:!*;  £ *(£fc>  5 s p(iii)  si  p(c  j i 

E(dj  >*S(di  >E<dp>+-+S(di ) • Std*.  i >»(!*) 

0(d)  - — — 

M<d1)*S<dl )V(d2>*  >S(di )...£<d*_ 1 
But,  by  definition,  0(0  > 0(d),  whence, 

0TQI 2 0' 

Q.  E D. 

Lemma  3.4.  Let  (aid)  and  (ftt"d)  be  strategies  over  the  same  set 
of  nodes.  Then, 

(3.7)  V(acd)  - V(*c“d)  » S(a)[V(c)  - V(c“)] 

Proof: 

(3.8)  V(acd)  - V(ac"d)  <=  V(a)  ♦ $(a)V(cd)  - V(a)  - S(a)V(c’d) 

(3.9) 


(3.10) 


- S(a)[V(c)*S(c)V(d)-V(c“)-S(c)V(d)J 

- S(a){V(c)  - V(c")] 
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Q.  E.  D. 

In  order  to  state  and  prove  the  main  theorem,  we  need  to  introduce  some 
additional  notation.  Let  and  (bj—  b^)  be  strategies.  Define  A'j  as  the  best 

permutation  of  (aj-  .a,.),  B,  -(bj-b,),  C,  - <12^ -b^ai ).  Sj>.  A’*  as  the  best 

permutation  of  (at+j~  ar),  B*  “ (ti+i  -bf).  C*  * (bj  + iai  + l-brM*  A*  * <8j+]A**i_,' 

Define  Cq  m Q*r  = &q  = A%  “ Bq  “ B*r  ” the  null  strategy,  with  £(X  ) « 1, 
¥<X)  * 0 

Consider  a strategy  over  the  set  L having  the  form  (lg),  where  l and  £ are 
strategies  over  the  non-overlapping  sets  £ and  G,  respectively.  Let  0“  be  the  maximum 
of  0 over  all  strategies  of  Q,  and  let  Q"  be  an  initial  bloc  of  fi,  end  HQ")  » strategy  for 
D"  such  that  0<HQ")  « 0”.  Finally,  let  b be  the  initial  segment  of  g,  consisting  of  t’(Q") 
possibly  interspersed  with  other  nodes  of  G not  belonging  to  2",  end  having  the  last 
element  of  HD”)  as  its  last  element.  We  now  prove  the  theorem: 

Theorem  3.1:  If  h contains  any  nodes  not  belonging  to  ft",  then 
its.)  can  be  improved  (weaKly)  by  moving  these  "intruding"  nodes  beyond 
the  last  node  of  B",  that  is,  by  bringing  the  nodes  of  Q"  to  the  front  of  h 
with  the  remaining  nodes  of  h following  them. 

Stating  the  theorem  in  the  notation  previously  introduced,  we 
designate  t’(Q")  by  ^ - (a^  4f).  and  b by  C,.  ■ {bjaj  -braf),  so  that 
Br  " <bj  b,-)  is  a strategy  on  the  intruding  nodes.  Finally,  we  define  m such 
that  g » c^-m-  By  definition  of  Q',  A’*o  ■ &*q.  The  permutation  of  h» 
asserted  by  the  Theorem  to  be  an  improvement,  is  then 

h"  - </W  - A*0B*0- 

H yttg)  is  infinite,  there  is  nothing  to  prove.  On  the  other  hand,  if 
¥(tg)  is  finite,  then 
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<3.1 1)  VOg)  - V(f)  ♦ S(f)V(Cr)  ♦ S(f)S<Cf)V(m)  < o> 
so  that  y<0,  WQf)  and  are  ail  finite.  Now, 

(3.12)  V(fA*0B*0m)  - V(f)  * S(f)V(A*0B*0)  ♦ S(»)S<A*0B*0)V<m) 
Now  £ is  commutative,  so  5<A*qB*q)  “ £<£*.)  Then, 

(3.13)  V(fg>  - V(fA*08‘0m)  - $(f)(V(Cr)  - V(A*QB*0)]. 
Therefore,  we  wish  to  prove  that: 

(3.14/  V(Cr)  - V<A*0B*0)  > 0. 

Note  that  a's  may  be  advanced  forward,  interchanging  them 
with  b’s,  since  Q"  is  an  initial  bloc  of  bl 

Proof:  By  identity,  and  remembering  that 
A%  - B%  - X , A’*0  - A*0. 

r 

(3. IS)  V(Cr)  - V(A*0B*0)  - Z [V<CjA  *iB*i)-V^Cj_1A'#i..1BT!j.1)3 

i-1 

Considering  the  individual  terms  of  fne  summation,  we  have: 
<3.16)  V<C( A’^B^-VfCj. j A’*, _ 1 0*j _J > 

-V(C(.lb(A‘.1BV-V<C,.1A-Vib(B*i) 

(3.17)  - V<C,.1b,A*i.]B,,)  V<Ci.ib|A’*j.1B*j) 
♦V<C|_1b1A-V1B,,)-V<C1.lA-V1b|B*,) 

But,  by  Lemma  3 4,  ^<Q,  _ j bj A*j_ j i 1 8^|)  £ 0. 

Therefore, 

(3.18)  V(CjA’*jB*j)  - V(C,A,,j.|B*i.1) 

> V(C,.1bift’*.1B*)- VtC^jA-j^biB*,). 

Applying  Equation  (1.10),  with  C,-j  ■ i» 

- k A'*-i  - £,  fi*  - i 

we  get, 
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(3.19)  V(C(A’*jB*)  - VIC.A'ViBVi) 

2 S(C,  .j)[P(A’*_j)V(bj)  - RbjMA’Vi)].  whence 

(3.20)  V(Cr>  - V(A*QB*0)  2 E S<CI_pV<bi>V(A*V1X0<A’*i_1)-0<bj)> 

i-i 

" T1  “ T2 

where, 

r 

(3.21)  Ti  » E S(Cj_ j )V(bj)V(A’*j_j  j0(A**j_ j ) 

t*l 

r 

(3.22)  T2  - I S(Cj . j )V(b, )V( A’*j . j )0(bj ) 

i-l 

Consider  Ij.  | is  the  best  strategy  of  a terminal  bloc  of  Q". 

Hence,  by  Lemma  3.3,  P(A’*  .j)  2 0*.  so  that 
r 

(3.23)  Tj  > I S(Cj_j)V(bj)V(&’*i_i)0* 

i-l 

Next,  consider  l2  Factoring  S^.j)  - SlBi-iJS^.j), 
in  (3.22),  we  obtain, 

r 

(3.24)  T 2=E  S(B|.1)V(bi)0(bi)S(Aj.1)V(A’,j.1) 

i-l 

Since  V.(A\  ) - 0,  we  have  the  identity: 

(3.25)  S( A- _ | >V{ A'*j_ | ) =E  Z:(  where 

i-l 

(3.26)  Z-  - SlA-.jMA^-.j)  - $(Aj)V(A’*J) 

Using  Equation  (3.25)  in  (3.24),  and  then  changing 
the  order  of  summation,  we  find, 
r r 

(3.27)  J2  -E  E SfBj.jMb.WbjJZ. 

i-l  j-1 
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r 1 

(3.28)  *E  LI  S(Bi_1)P(b,J 
j-l  i-1 


(3.29)  - £ ZjP(Bj)  - E ZjV(Bj)j8(Bj) 

i“l  i-i 

But  satisfies  the  conditions  of  0 of  Lemma  3.1, 
with  &Y  as  & 

Therefore,  by  that  lemma,  JKBj)  5 jS“  Hence, 


(3.30)  T2<[  I Z:V(8:))0" 
i-1 


(3.31) 

S[E  Z;  I SlQ.^Mbi)#- 

j-1  i-l 

r r 

(3  32) 

< 

£ S'.Bj.j)  £ ZjVtbj#* 

i-1  i-1 

r 

(3.33) 

< 

£ $(Bj_1)V(bj)S(Ai_1)V<A’V1¥' 

r 

(3.34) 

5 

£ S(Cj.1)V(bj)V(A’*j.j)^“ 

i-1 


Combining  (3.23)  and  (3.31),  we  have,  finally: 
(3.35)  V(Cr)  - V(A*0B*0)  2 T]  - T2 


* E [S(Cj _ j )V(bj )V(A’ ’ _j  ))(p“  - f)*>  - 0. 
i-1 

Comparing  (3.35)  with  Equation  (3.14),  we  see  that  this  is 
the  result  we  want,  and  the  theorem  is  proved. 

Q.  E.  D. 
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4.  Conclusion 

In  conclusion  we  wi  ,h  to  comment  on  how  our  results  car  be  used  in  constructing 
search  algorithms.  We  will  consider  the  case  of  a search  through  a partial  ordering 
since  search  through  a parallel  ordering  and  independent  search  can  be  regarded  simply 
as  special  cases. 

In  some  applications,  we  will  have,  in  advance,  a map  of  the  enure  search  graph, 
together  with  an  estimate  of  c4>  for  each  of  its  nodes.  In  this  case,  0"  can  be  estimated 
for  each  node  after  determining  the  maximal  indivisible  blocs.  In  other  applications,  the 
search  tree  will  only  evolve  in  the  course  of  the  search  itself.  Then  0"  cannot  be 
determined  from  the  p(i)’s,  but  will  have  to,  somehow,  be  estimated  directly. 

In  the  case  where  s map  of  the  search  graph  is  given  in  advance,  determination 
of  the  0"  may  be  considerably  facilitated  by  using  an  algorithm  described  by  Garcy 
<1973).  This  algorithm  provides  a method,  applicable  to  the  "tree-like"  portions  ot  a 
partial  ordering,  for  reducing  the  entire  set  of  nodes  to  a smaller  set,  essentially  by 
discovering  the  maximal  indivisible  blocs  and  the  optimal  strategies  for  them.  After 
Garey’s  algorithm  has  been  applied  to  carry  the  reduction  as  far  as  possible,  the 
maxima'  indivisible  blocs  for  the  reduced  system  can  be  discovered,  and  the  values  of  0“ 
associated  with  them  computed.  Once  these  values  have  been  found,  or  estimated 
directly  if  the  search  graph  is  not  given  in  advance,  the  algorithm  described  in  the  next 
paragraphs  can  be  used  to  order  the  nodes  optimally. 

Suppose  that,  in  a partial  edering,  we  assign  to  each  branch,  0,  at  each  node,  N» 
an  ordinal  number,  Then,  we  can  construct  a search  algorithm  of  the  familiar 

search-scan  variety,  as  follows; 

Let  <3,^)  designate  branch  g at  node  & arid  let  L he  the  list  of  all  LS.CJ)  pairs 
available  for  search,  ordered  according  to  tiB.ti). 
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1.  Choose  the  first  (BLN)  pair  on  L 

and  generate  the  new  node,  IT; 

2.  If  51'  a goal  node  stop,  else; 

3.  Compute  E(5f\B)  for  all  branches  from  M*, 

and  insert  the  new  (5TB)  pairs  in  their 
appropriate  positions  in  L» 

4 Return  to  Step  1. 

Theorem  3.1  shows  that  if  £<51" >£D is  set  equal  to  the  0"  defined  in  the  text,  then 
the  search  determined  by  the  above  algorithm  is  optimal. 

The  aiue,  JR5I.0),  depends  on  the  probabilities  of  reaching  tho  goal,  p(i),  at  some 
of  the  nodes  that  are  descendants  of  {£•  In  practice,  these  values  of  jj(i)  will  usually  not 
be  known,  and  c m ine  algorithm  wiit  have  to  be  s heuristic  estimate  o{  --  some 

estimate  of  the  "promise"  of  searching  from  5|  in  the  direction  defined  by  B.  Theorem 
3.1  indicates  what  the  nature  of  this  estimate  should  be. 

In  estimating  |T,  we  must  postulate  that  the  search  will  be  continued  through  a 
best  set"  of  nodes,  the  set  of  reachable  nodes  that  maximizes  the  ratio  of  expected 
return  to  cost  of  search.  Search  should  continue  in  One  direction  at  least  as  long  as  this 
ratio  continues  to  increase.,  and  in  fact,  until  it  becomes  lower  than  the  best  ratio  for 
some  other  branch.  By  this  procedure,  depth  of  search  is  balanced  against  the 
expectation  of  success,  so  that  a modest  probability  of  success  after  a short  search  may 
imply  the  same  0"  as  a higher  probability  of  success  after  a longer  search. 
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